Network Performance in High Performance Linux Clusters
نویسندگان
چکیده
Linux-based clusters have become more prevalent as a foundation for High Performance Computing (HPC) systems. With a better understanding of network performance in these environments, we can optimize configurations and develop better management and administration policies to improve operations. To assist in this process, we developed a network measurement tool to measure UDP, TCP and MPI communications over high performance networks, such as Gigabit Ethernet and Myrinet. In this paper, we report on the use of this tool to evaluate the network performance of three high performance interconnects in HPC clusters: Gigabit Ethernet, Myrinet, and Quadrics’ QsNet and discuss the implications of those results for configurations in HPC Linux clusters.
منابع مشابه
Network Performance in Distributed HPC Clusters
Linux-based clusters have become prevalent as a foundation for High Performance Computing (HPC) systems. As these clusters become more affordable and available, and with the emergence of high speed networks, it is becoming more feasible to create HPC grids consisting of multiple clusters. One of the attractions of such grids is the potential to scale applications across the various clusters. Th...
متن کاملParallel computing using MPI and OpenMP on self-configured platform, UMZHPC.
Parallel computing is a topic of interest for a broad scientific community since it facilitates many time-consuming algorithms in different application domains.In this paper, we introduce a novel platform for parallel computing by using MPI and OpenMP programming languages based on set of networked PCs. UMZHPC is a free Linux-based parallel computing infrastructure that has been developed to cr...
متن کاملPerformance Considerations for Network Switch Fabrics on Linux Clusters
One of the most significant components in a cluster is the interconnection network between computational nodes. A majority of today’s clusters use either switched Fast Ethernet, Gigabit Ethernet, or a specialized switch fabric to connect nodes. However, the use of these specialized switch fabrics may not necessarily benefit the users, and in some cases they perform only slightly better than com...
متن کاملPVFS: A Parallel File System for Linux Clusters
As Linux clusters have matured as platforms for lowcost, high-performance parallel computing, software packages to provide many key services have emerged, especially in areas such as message passing and networking. One area devoid of support, however, has been parallel file systems, which are critical for highperformance I/O on such clusters. We have developed a parallel file system for Linux c...
متن کاملEnhancing TCP Performance for Dedicated Clusters and Grids
TCP congestion control methods seriously and unnecessarily harm performance of network transmissions when used in dedicated clusters and grids. We present a simple method in which congestion control can be disabled under appropriate circumstances while still addressing fairness issues and avoiding congestion collapse. We discuss a Linux-based implementation of this “Rude TCP”1 and demonstrate t...
متن کامل